The price of re-establishing perfect, almost perfect 
or public monitoring in games with arbitrary 

monitoring 

Mael Le Treust* and Samson Lasaulce* 
*Laboratoire des Signaux et Systemes, CNRS - Universite Paris-Sud 11 - Supelec, 91191, Gif-sur-Yvette Cedex, France 

Email: {letreust}, {lasaulce} @lss. supelec.fr 



(N 
O 

> 

O 

O: 

m 
(N 



> 

m 
d 



X: 



Abstract — This paper establishes a connection between the 
notion of observation (or monitoring) structure in game theory 
and the one of communication channels in Shannon theory. One 
of the objectives is to Itnow under which conditions an arbitrary 
monitoring structure can be transformed into a more pertinent 
monitoring structure. To this end, a mediator is added to the 
game. The objective of the mediator is to choose a signalling 
scheme that allows the players to have perfect, almost perfect or 
public monitoring and all of this, at a minimum cost in terms 
of signalling. Graph coloring, source coding, and channel coding 
are exploited to deal with these issues. A wireless power control 
game is used to illustrate these notions but the applicability 
of the provided results and, more importantly, the framework 
of transforming monitoring structures go much beyond this 
example. 

I. Introduction 

Observation or monitoring structures are omnipresent in 
games, especially in dynamic games. Monitoring structures 
specify what the players effectively observe. These observa- 
tions allow a given player to construct his private history, 
which is used, at a given instant, as an input of a function 
defining his strategy. For instance, observations may consist of 
action profiles (this is the case in repeated games with perfect 
monitoring |18| and fictitious play (31), arbitrary signals (this 
is the case in repeated games with public signals |1T9| and with 
an observation graph |14|), or realizations of the individual 
utility function (this is the case in stochastic games between 
learning automata ifTSI and repeated game with incomplete 
information |8|). The problem is that when players interact in 
a game with an arbitrary observation structure, the possible 
outcomes might turn out to be unpredictable and, even when 
they are, they might not have important properties such as 
Nash equilibria. To be concrete, the characterization of equilib- 
rium utilities in repeated games with an arbitrary observation 
structure is still an open problem |15|. In interactive situations 
where game theory is relevant like distributed power control 
in wireless networks ifTTI . it is common that terminals do not 
observe the transmit power levels of the other terminals (12], 
[?]. Being not able to predict all possible operating points for 
such a network may cause a problem for the network designer 
In particular, ensuring the existence of efficient Nash equilibria 
can be highly desirable when terminals implement learning 
algorithms with partial observations |20|. 

The above considerations show the importance of being able 



to transform a given monitoring structure into a new one. But, 
how can this be done? And at what price? This paper precisely 
falls in the general framework which consist in proposing 
solutions to implement such transformations and evaluating 
their cost in terms of signalling. As far as the provided results 
are concerned, the authors do not provide complete answers 
to these new questions. Indeed, the scope of this paper is as 
follows. First, one way to transform a monitoring structure 




Fig. 1. Interpreting the monitoring structure of a dynamic game as a 
communication problem. 



into a new one is to add a mediator (see Fig. |l| in the game: 
this mediator does not have a strategic role here and is only 
used for improving the observation capabilities of the players. 
Second, even if the initial monitoring structure (without the 
mediator) can be effectively arbitrary, the desired monitoring 
resulting from the addition of the mediator is assumed to be 
perfect, almost perfect or public, and therefore not arbitrary 
(the latter case is left as a significant extension of this work). 
In the example of distributed power control, the players would 
be the decisionnally autonomous terminals while the mediator 
would be a base station or a relay node. Whereas the ideas 
presented here seem seducing, the question is how to tackle 
this general problem. One of the contributions of this paper is 
to re-interpret observation structures in games as channels in 
communication theory. Exploiting this interpretation, several 
questions arise. Based on what the mediator observes, does 
there exist a source code (at the mediator) which allows the 
players to re-establish a perfect, almost perfect or public obser- 
vation of an information source (the action profiles typically)? 
What is the minimum cost of signalling to re-establish such an 
observation structure? Is the Shannon capacity |17| associated 



with the initial observation structure high enough to convey the 
required amount of signalling? Shannon theory [4| and graph 
theory \2\ brings appropriate answers to all these questions. 
As it will be seen, the connection we establish between game 
theory and Shannon theory opens many other interesting issues 
such as: proving some equilibrium utilities are impossible to 
reach in certain games because of limited channel capacities 
of the considered observation structure; defining new channels 
in communication theory from observation scenarios in game 
theory. 

We provide a characterization of compatible monitoring 
structure and a coding scheme that reconstruct e-Perfect Mon- 
itoring in Sec. [nil After computing the price of re-establishing 
the almost perfect monitoring (PREEPM) we investigate the 
reconstruction of Perfect Monitoring of the source in Sec. 
II VI and the one-shot reconstruction of the almost Perfect 
Monitoring in Sec. |V] We illustrate our results with the well- 
known "prisoner's dilemma" in Sec. |VT] The proof of the 
theorem are provided in the appendices |A] 

II. System Model 

The purpose of this section is twofold: to review some 
basic concepts and definitions from dynamic games, which are 
essential for understanding the subsequent sections; to state the 
general problem under investigation. Following the definition 
of Ba§ar and Olsder (11] pp. 205), a dynamic game consists in 
a sequence of stage games F = (^*)tgN« where at each stage 
i G N*, we have: 

= (/c,{n*}.«,KW,w*,/*, 

{•SDieK, {gDieic, {hljieic, {T^jieK) 

Denote K. = {l,...,K} the set of players constant along 
the game, T'l,...,V'jf are the corresponding sets of actions, 
7r*,...,7r^ are the payoff (or cost) functions, is the state 
parameter and /* is the state transition function, g*,...,;?^ 
are the private monitoring functions at stage t and 5* , . . . , Sj^ 
are the corresponding sets of private signals, h\,...,h*j^ are 
the private histories and tI,...,t^ are the strategy functions. 
Game stages correspond to time intervals at the beginning of 
which players can choose their actions. 

I ® 1' 




Fig. 2. The private monitoring channel. 



The strategic information is modeled by an information 
source where a{t) is produced by the source at stage t. 
This strategic information may consists in action profiles or 



arbitrary signals. We assume that, for a given game stage 
t > 1, each player i ^ JC knows and can take into account 
the past realizations of his private observation Si drawn from 
the private monitoring gi (see Fig. |ll|. Denote A(Z) the set 
of probabilities over the set Z. 

g,:A^ A(5,) (1) 

The main difference between static games and dynamic games 
is that players can take into account the sequence of past 
strategic signals in their long-run strategy. Increasing the 
amount of strategic information, increase the strategy space of 
the players. The vector h\ = (si(l), Si{t — 1)) is the private 
history of player i, at stage t and lies in the set "H* = {Sif^^. 
A strategy Ti for player z S /C is a sequence of strictly causal 
functions {n,t)t>i^ 

n.t -nl^rl (2) 

Let Ti be the set of strategies of player i G /C and r = 
(ti, ...^tk) be a joint strategy. 

We introduce an additive signalling structure called "the 
mediator assisted monitoring channel", represented in Fig. 
(3). It consist of a triple (W, m, /) where W denote the 
mediator, m the observation channel of the mediator and / 
the communication channel between the mediator and the 
players. The mediator also observes a noisy version q of 
the information source a. It's has to relay every relevant 
information to the players in order to make them monitors the 
information source. The observation channel of the mediator 
is defined as follows. Denote A the set of strategic information 
and Q the set of signals observed by the mediator. 

m:A^ A(g) (3) 

The communication channel between the mediator and the 
players where X is the set of channel inputs and Yi is the 
set of signals observed by player i e IC. 

f:X^A{Y,x Y2) (4) 

Thus at each stage t > 1 of the game, the players obtain a 
private observation s* and a mediator's signal yf. We investi- 
gates the properties of such an additive signalling structure in 
order to answer the question: Are the players able to observes 
the information source or not ? 

Denote a* the reconstructed version of the source by player 
i G /C. The course of the signalling process begins with the 
strategic information a, generated by the source at a given 
stage. The mediator W is assumed to have an imperfect ob- 
servation (namely q) of the symbols a generated by the source 
and knows the information structure of every player. Taking 
this knowledge into account, the mediator applies certain 
mathematical operations on what it observes and broadcasts 
a public signal x to all the players. Therefore, each player 
i G /C receives a private signal Si and an additional signal 
from the mediator denoted by yi. 




Fig. 3. The mediator-assisted monitoring channel. 



III. Reconstruction of the e-Perfect Monitoring 

In this section, we investigate the reconstruction of the 
£-Perfect Monitoring. We introduce an additive signalHng 
structure (W, m, /) which operates as a relay in order to send 
an additional signal to the players. We provide conditions over 
the additive signalling structure in order the players monitors 
almost perfectly the source of strategic information. We first 
recall the definition of e-perfect monitoring available in the 
literature ISj, ||9) and we present a "max-min formulation" to 
compute the error parameter e. Then we define properly the 
"reconstruction" of the e-perfect monitoring at the players. 
Based on a graph-coloring approach, we provide two con- 
ditions over the additive signalling structure (W, to, /) that 
are sufficient to reconstruct the e-perfect monitoring for the 
players. We call the first condition: "the (x, y)-coloring con- 
dition". It regards the observation function of the mediator m 
and it guarantee that the mediator can reconstruct the e-perfect 
monitoring. The second condition concerns the communication 
channel / between the mediator and the players and is called 
the "essential information condition". It guarantee that the 
capacity of the channel / allows the mediator to communicate 
the strategic information to the players. 

In this section, we investigate the reconstruction problem 
using the framework of Shannon |17|. We make the following 
assumptions on the information source, the private monitoring 
and the mediator assisted channel. 

• The information source is discrete and i.i.d. 

• The monitoring structure is stationary. 

• The players may tolerate a delay in the signalling. 
These assumption allow us to derive the fundamental limit 
derived by Shannon on the information transmission. The 
results, we present in this section, are based on the three 
above assumptions. However, the strategies of the players 
may not always satisfy this properties. We relax these three 
hypothesis in Sec. (Iy]l and we derive alternative limits over 
the information transmission. 

Definition 1: |5|, |9| A monitoring A : A — > S^) 
is e-perfect (or almost perfect) if for each player i G /C there 
exists a partition Ti — {T!f : a e A) of the signals such 
that for all a € A, 



E 



A(crj 



> 1 



(5) 



We characterize the precision of the monitoring using a "max- 
min formulation". 

Proposition 1: A monitoring A is e-perfect if and only if 

1— e = min max min > A(cr;|a) 
■i=> e = max min max > A(cr.;|a) 

ie/C Ti = (T°)„ a(iA ^ 

Proof: See Appendix lAl ■ 
After a joint action a is played, each player i ^ IC obtains 
a private signal Si drawn from a private monitoring gi. 



9i 



A —> A(5,) VieK 



(6) 



The mediator observes a signal q drawn from the observation 
channel m. 



A 



A(Q) 



(7) 



Then it send through the communication channel / an additive 
signal to each players. 



/ 



X 



(8) 



This communication procedure induces a pair of signals cTi = 
{si,yi) for each player where Si comes from the private moni- 
toring gi and yi comes from the additional signalling structure 
(W, TO, /). We derive conditions over the additional signalling 
structure (W,to, /) such that the joint signal ct,; = {si,yi) 
satisfies the e-perfect condition. 

A. Reconstruction of the e-Perfect Monitoring 

We define the notion of code in this framework. The 
mediator observes a sequence of signals q and reduce it to 
what we called the "essential information sequence" r using a 
graph coloring argument. Then it encodes the sequences r into 
a sequence x using a joint source-channel coding procedure. 

The players will decode the "essential information" r using 
the channel output yi and the private observation s^. This "es- 
sential information sequence" r combined with the sequence 
of private monitoring characterizes a unique sequence a of 
joint actions. 

Definition 2: A (n, h, (p, {ipi)i(^ic)-code is a pair of encod- 
ing functions for the mediator: 

h : Q — > R, "essential information" 

(j) : i?" — > X", "source-channel encoding" 

and a decoding function for each player: 

tpi : X S*" — > A",Vi e IC, "source-channel decoding" 

We quantify the precision of the joint signal a-,; = (si, J/i) 
using the following definition. 

Definition 3: The mediator can reconstruct the e-Perfect 
Monitoring if. 



V(5 > 0, 3(n, h, (p, ('0i)ieK)-pi"ocess such that, 



r 



A(CT,|a) > 1 



>1-S 



For a given private monitoring structure {gi)ieK, we provide 
sufficient conditions over the additive signalling structure 
(W, m, f) such that the mediator can reconstruct the e-perfect 
monitoring. Two natural questions arises : When the mediator 
observation function m is sufficiently precise to guarantee the 
£-perfect monitoring at the players ? When the communication 
channel / between the mediator and the players allows to 
transmit all the relevant information ? 

We provide an answer to the first question using the {x, y)- 
coloring condition in the next subsection (IIII-Bl l. The second 
question will be investigate in subsection ( Illl-Cb using the 
concept of "rate of essential information". 

B. The {x,y)-coloring Condition 

We define the {x, y)-coloring condition in order to charac- 
terize the observation functions m of the mediator that are 
compatible with every private monitoring cji of the players 
i e /C. This condition is based on a graph-coloring approach. 
We represent the private monitoring using an auxiliary graph 
(see Def. |5]l whose vertices are the joint actions a. There is 
an edge e between two vertices a and a' if both joint action 
induce the same signal s,; with large probability. 

The main idea is the following. If the observation of the 
mediator m is a coloring of the auxiliary graphs, then the 
information m passing through the mediator is completely 
orthogonal to the private information cji. Thus every joint 
actions can be distinguished by the players and the e-perfect 
monitoring can be reconstructed. 

Definition 4: Define the equivalence classes of actions for 
each of the private monitoring g,; with i G AC as follows. 

G,(a) = {s, e 5„ g,{sM) > 1/2}, (9) 
a h ^ G,{a) ^ G,{h) (10) 

Denote Ag^ = {a^} the partition of A into equivalence classes 



with respect to the relation 
monitoring m. 



-. In the same way with the 



M{b) = {qeQ, m{q\b)> 1/2}, (11) 
a r^rn M{a) = M{b) (12) 

Denote Am — {(Xm) the partition of A into equivalence 
classes with respect to the relation ~.,„. These equivalence 
classes induce a family of auxiliary monitoring defined by. 



9i 



a,; 



and 



171 : A„ 



igi{s\a))aeai 

A(Q)I^I 

{m{q\a))aea^ 



(13) 
(14) 



(15) 
(16) 



The precision of the auxiliary monitoring gi and m are 
computed in the following way. Let {Sa}a£Ag. a partition 
of the signals s of player i indexed by the equivalence classes 
a € Ag.. Define in the same way {Qp} peA,^ a partition of 



the signals q of mediator indexed by the equivalence classes 

/? e Am- 



max mm mm 



max min min > m{q\a) ~ v 

peA^ aep ^ ^ 



(17) 
(18) 



The monitoring gi is Xi-perfect and rh is y-perfect. 

Definition 5: The auxiliary graph of player i E K., denoted 
Qi — {A,Ei) is defined as follows. 



3e, = (a, b) e E, 



(19) 



Inspired from graph coloring we define the following concept 
of (cc, y)-coloring. 

Definition 6: The monitoring gi and m satisfy an {x,y)- 
coloring condition if : 

> The auxiliary monitoring gi is x perfect, 

> The auxiliary monitoring m is y perfect, 

« The partition {Qp}peArn induced by the auxiliary mon- 
itoring ?fi is a coloring c : A — Q of the graph Qi. 

Remark that the last condition is equivalent to the following 
one: the auxiliary monitoring gi is a coloring of the graph Qm 
defined by = (a, b) G Em <J==> a ~m b. 

C. The Rate of Essential Information 

We define the rate of essential information in order to 
characterize the channels / between the mediator and the 
players that are compatible with the amount of information the 
players need. It could happened that the observation channel 
m of the mediator satisfy the above (a;, j/)-coloring condition, 
but not all the information q is relevant. 

In this subsection, we aim at reducing the relevant in- 
formation to it's minimum. To do so, we use a second 
coloring condition over a bi-auxiliary graph Q to eliminate 
any redundant information between the signals q and s,;. We 
call the "essential information" the sequence r corresponding 
to a concatenation of the sequence of signals q. 

Definition 7: The bi-auxiliary graph Q = {Q,E) is defined 
as follows. 



3e = {q,q')eE 



3i e K., 3a, be A, s.t. (20) 

q e m(a), (21) 

q' G m{b), (22) 

a b (23) 



Definition 8: Let h : Q — )■ R the minimal coloring of the 
bi-auxiliary graph Q and denote the random variable r essential 
information drawn from the distribution h®m®p such that 

-PW = Ea,gP(a)™(9|a)'*(''k)- 
Define the essential rate as follows. 



H — max H {r\si) 



(24) 



where the random variable Si is drawn from the transition 
Tr-R — > MS,) with. 



T.(s|r) = 



'Ea,qPi(^)M<l\a)Hr\q)gt{s\a) 



(25) 



(26) 



Ea,9 J2a.^qPia)m{q\a)h{r\q) 

In the following, such a mapping h is called recoloring of 
monitoring m. The following coding theorem for broadcast 
channel with common messages fTOl provides us an upper 
bound for transmits to the players the strategic information. 
Theorem 1 (Korner, Marton 1977 HJW): The capacity Cq 



of the broadcast channel / 
common messages is exactly. 



X 



Cq — max mm I(X:Yi) 

peA(X) ieic 



A(n.«^o with 



(27) 



The coding theorem we present is constructed over large 
blocs of strategic signals. Its implies that the players may 
tolerate a delay in the reconstruction of the e-perfect moni- 
toring. This assumption is relaxed in section (|V|i below and 
an alternative result is presented. 

D. Main Result 

We provide two conditions that ensure the additive sig- 
nalling structure (W, m, /) is compatible with the recon- 
struction of the £-perfect monitoring. The first condition is 
based on the (a;, ?/)-coloring condition (see subsection JIII-Bb ) 
and guarantees that the mediator is sufficiently informed 
to help the players reconstruct the desired monitoring. The 
second condition is based on the "essential information" (see 
subsection jlll-Ct ) and ensures that the additional information 
the mediator obtains, is compatible with the communication 
constraints of the channel between the mediator and the 
players. 

Condition (1) : There exists a pair {x, y) such that x + y — 
xy < e and for each player i E IC, the private monitoring 
gi and the monitoring of the mediator m satisfy an {x,y)- 
coloring condition. 

Condition (2) : The essential rate H satisfy H < Cq, the 
capacity Cq of the channel / with common messages. 

Theorem 2 (e-PM): Fix a strategy profile p e A (A), a 
monitoring structure M — {m, {gi)i^ic, /) and an e > 0. 
If the monitoring structure A4 satisfy conditions (1) and (2), 
then the mediator can reconstruct the e-Perfect Monitoring. 
Proof: The proof is detailed in Appendix |A] ■ 

We provide conditions over the additive signalling structure 
(W, TO, /) that are sufficient to reconstruct the e-perfect moni- 
toring for the players. Note that a complete characterization is 
not available due to the problem of characterizing the precision 
of a two parallel monitoring functions. 

We obtain a set of admissible additive signalling structure 
{yV, TO, /) and we need an evaluation method to choose the 
best admissible additive signalling structure (>V, to, /) in term 
of signalling cost. For that reasons, we introduce the price of 
re-establishing e-perfect monitoring as the ratio between the 



number of bits of the additive signalling and the number of 
bits of the source of strategic information. 

Definition 9: Define the price of re-estabhshing e-Perfect 
Monitoring: 



PREEPM°°(£) = 



max^gK: H{R\Si) 
H{A) 



(28) 



The worst case correspond to the situation where the mediator 
directly send the entire sequence of joint actions a. In that case 
the price is equal to 1. Obviously, the players would have all 
the strategic information and they can reconstruct the moni- 
toring perfectly. However, this situation is not very interesting 
from our point of view since the capacity constraints between 
the mediator and the players may forbid the transmission of 
the strategic information. 

Finding the minimal price of re-establishing e-perfect mon- 
itoring is equivalent to finding the optimal admissible additive 
signalling structure (>V,to, /). 

IV. Reconstruction of the Public Monitoring 

The problem of strategic observation are well studied in the 
framework of repeated game with public monitoring. In this 
section, we assume that the source of strategic information is 
no more a joint action but a public signal. For example, if the 
public signal we consider satisfies the "individual and pairwise 
full rank conditions" of [6|, then the set of the equilibria is 
fully characterized even if the game is stochastic. We extend 
our results to the perfect reconstruction of the information 
source without error (i.e. where e = 0). We provide sufficient 
and necessary conditions on the additional signalling structure 
W for being compatible with the reconstruction of the perfect 
monitoring. 

A. The "Painting" Condition 

The main difference here is the precision of the monitoring 
of the information source: e = 0. We provide here a necessary 
and sufficient condition over the observation function to of 
the mediator such as reconstruct the perfect monitoring of the 
source of strategic information. This condition is also based 
on graph coloring and we called it "the painting condition" in 
reference to C. Berge. 

We construct a graph where the vertices are the public 
signals a. There is an edge between to publics signals a and a' 
if the same private signal Si is drawn with positive probability. 
We prove that the observation of the mediator is orthogonal 
to the private monitoring if and only if the observations q of 
the mediator is a coloring of the graph. 

Definition 10: Denote the sets of possible signals. 

G,(a) = {s, G 5„ 5,(s» > 0}, Vz e /C (29) 
M{b) = {q€Q, m(q\b) > 0} (30) 

Definition 11: The auxiliary graph of player i G /C, denoted 
Qi — {A,Ei) is defined as follows: 



3e, = (a, b)eE,^ G,{a) n G,(6) ^ 



(31) 



We define the concept of painting of a graph Q as a 
correspondence m : A ^ Q if every selection ffi : A ^ Q of 
m is a coloring of the graph Q. 

Definition 12: The monitoring of the mediator to is a 
painting of the family of graphs {Gi)ieK. induced by the private 
monitoring {gi)ieK if for alH G /C we have 



3e, = {a,h)^E, 



m{a) n m{b) = 



(32) 



B. Main Result 



As in the previous section, we provide two conditions (over 
TO and /) such that the additive signalling structure (W, to, /) 
is compatible with the reconstruction of the perfect monitoring. 
This result is stronger than the previous one because we 
provide necessary and sufficient conditions. 

Definition 13: Define the following conditions: 
Condition (1') : The monitoring of the mediator m is a 
painting of the family of graphs {Gi)ieK- 
Condition (2) : The essential rate H satisfies H < Co, the 
capacity Co of the channel / with common messages. 

Theorem 3 (PM): Fix a strategy profile p £ A (A) and 
monitoring structure Ai — (to, {gi)i^ic, f)- 
The mediator can reconstruct the Perfect Monitoring for 
Strategy p if and only if the monitoring structure A4 satisfy 
conditions (1') and (2). 

Proof: The proof is detailed in Appendix [A] 

■ 

We obtain a set of admissible additive signalling structure 
(W, TO, /) and we introduce the price of re-establishing per- 
fect monitoring in order to evaluate the performance of the 
reconstruction. 

Definition 14: Define the price of re-establishing Perfect 
Monitoring: 



PRPM°°(e) = 



(33) 



Finding the minimal price of re-establishing e-perfect mon- 
itoring is equivalent to finding the optimal admissible additive 
signalling structure (W, to, /) for reconstruct the perfect mon- 
itoring of the information source. 

V. One-shot Reconstruction of the e-Perfect 
Monitoring 

In the previous sections, we have assumed that the source of 
strategic information was i.i.d., the channel was stationary and 
the players tolerate a delay before reconstructing the desired 
monitoring. In this section, we relax these three hypothesis 
and we investigate a "one-shot" reconstruction of the e- 
perfect monitoring. Note that, the techniques we develop in 
this section also apply to the reconstruction of the perfect 
monitoring. 

Once the strategic information is drawn, the mediator pro- 
vides an additional information to the players before the end of 
the game stage. The definition of the one-shot reconstruction 
consists in replacing the number n of stages by 1 in the 
definition of the long term reconstruction in SecHIIl 



A. The Condition z-Perfect 

The main difference regards the communication channel / 
between the mediator and the players. Assuming the "one- 
shot" reconstruction prevent us to use the classical coding 
scheme from Shannon theory. We introduce the condition z- 
perfect in order to characterize the channels / (see Fig. 3) 
between the mediator and the players compatible with the one 
shot reconstruction of the e perfect monitoring. 

Definition 15: The channel between the mediator and each 
player is z-Perfect if, for each players, there exists a partition 
Yi = {Y^}reR of the signals indexed by the set of essential 
information R such that for all r £ R: 



E 



f{y^\r) >l-z 



(34) 



B. Main Result 



The result of this section is widely based on the one in 
section dllll i. Define as above the bi-auxiliary graph Q, and 
the mapping h : Q — > R is called recoloring of monitoring 
TO. The condition (2) with the entropy inequality of theorem 
(|2]l is replaced by the z-perfect condition (2'). 

Definition 16: Define the following conditions: 
Condition (1) : For each player i E IC, the private monitoring 
gi and the monitoring of the mediator m satisfy an {x,y)- 
coloring condition. 

Condition (2') : The channel / between the mediator and each 
player is z-Perfect. 

Theorem 4 (e-PM): Fix a strategy profile p e a 
monitoring structure A4 = {'m:{gi)ieKi f) ™d an e > 0. 
If the monitoring structure M satisfy conditions (1) and (2') 
with. 



x-'ty + z — xy — xz — zy-\' xyz < e 



(35) 



Then the mediator can reconstruct the e-Perfect Monitoring in 
one-shot. 

Proof: The proof is detailed in Appendix lAl ■ 
We provide conditions over the additive signalling structure 
{W, TO, /) that are sufficient to reconstruct the e-perfect mon- 
itoring in one-shot. Remark that a complete characterization 
is not available. 

In order to evaluate the best additive signalling structure 
(W, TO, /), we introduce the price of one-shot re-establishing 
the e-Perfect Monitoring. In the one-shot case, the entropy 
H{A) is replaced by log 

Definition 17: Define the price of one-shot re-establishing 
£-Perfect Monitoring: 



PREEPM = 



log|j?l 
log|A| 



(36) 



Finding the minimal price of one-shot re-establishing e- 
perfect monitoring is equivalent to finding the optimal admis- 
sible additive signalling structure (W, m, /). 



VI. Prisoner's Dilemma 

We consider a simple wireless power control game where 
our result may direclty apply. Following the framework of 
Goodman and Mandayan fT], we consider a decentralized 
multiple access channel where the players choose their power 
control policy in order to maximize their energy efficiency. 

We consider a two player power control game where the 
actions are the transmit power pi and p2- The energy-efficiency 
utility is defined as follows: 

/(SINK, 



[bit/ J] 



i 6 IC 



(37) 



where the function f{x) is sigmoidal (here we take f{x) — 
(1 — e^^)*^. The SINR at receiver i £ /C writes as: 

SINR, = , , (38) 



with parameters \gi\'^ for the channel gain and for the noise 
variance. 

In our previous work over this communication model lfT2l . 
we investigate two interesting power levels. The first is the 
power of the Nash equilibrium denoted p* and the second is 
the power of the operating point which provide a Pareto 
optimal utility. The power p* and pi are defined respectively 
in equation (4) and (9) of the article [12|. 

In order to illustrate our results, we provide an complete 
analysis of a simple example which can be easily generalized. 
We consider a two player power control game where only 
two power levels {p*,pi) are available to the players. Fix 
the parameters of the power control game for the random 
CDMA case. The number of players K — 2, the number of 
symbols M = 2, the spreading factor N ^ 2, the channel 
gains |(7ip — |g2p — 1 and the noise variance <t^ = 1. 
The set of achievable utility is described by the following 
payoff matrix. The utility pair (0.10, 0.34) mean that the utility 
of player 1 is 0.10 and 0.34 is the utility of player 2. The 
region of achievable utility using pure and mixed strategies is 
represented by the quadrilateral on figure (4). 
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0.23,0.23 


0.10,0.34 


0.34,0.10 


0.15,0.15 



the players' actions. Denote the private monitoring of player 
i, gi : A — > Si with precision parameters x' < x. 




Denote Ag^ , Ag^ the equivalence classes of private moni- 
toring gi and 92 over the actions A. 

Ai = {{piP2,pipl);{p\p2,plp*2)}^{a,a'} (39) 

A2 = {{PIP2.PIP2)-,{P1P*2.PIP*2)}^{P,P'} (40) 

These equivalence classes induce a pair of auxiliary monitor- 
ing denoted 



91 ■■ M 

a 

92 ■ A2 

Taking the partitions S, 



A(5i)l^l 

{gi[s\a))aea 

A(52)l^l 

{92{s\a))aef} 



(41) 
(42) 
(43) 
(44) 



si ; Sa' = s'l and ^ S2 ; 
3/3' = S2 we calculate the precision of the auxiliary monitoring 
gi and 52 ■ 



min min > oifsla) = x 

aeAi aiEa ^ ' ^ 

s6Sc 

min min > q2(s\a) = x 

seSfs 



(45) 
(46) 



The monitoring gi is x-perfect and 52 is a;'-perfect. 

The monitoring graphs corresponding to the above equiva- 
lence classes of the private observations are. 
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P1P2 



Remark that this game is strategically equivalent to the 
Prisonner's Dilemma where the Nash equilibrium correspond 
to the joint action {pl,P2) and the social optimal action 
correspond to {pi,p2)- We consider as an example a private 
monitoring structure gi , g2 as defined below. We fix the 
additive signalling structure (W, to, /), an i.i.d. mixed strategy 
p £ A (A) and we prove it allow the reconstruction of the e- 
perfect monitoring. We compute the price of re-establishing 
equilibrium conditions. 

The source of strategic information in this case, represents 
the sequence of actions of the players. The actions are sup- 
posed to be drawn i.i.d. from a distribution over the set of 



Note that by construction, each of those graph is an union 
of complete graphs. 

The mediator observes a signal drawn from m : A — > Q. 

Denote the equivalence classes of the monitoring to of the 
mediator over the actions A. 

A„, = {{PlP2)■,{PlPhp:P2)■,{p:p*2))} = {l,l'^"¥^) 

These equivalence classes induce an auxiliary monitoring 
denoted 

m:Am A(g)'^l (48) 

7 — > {m{q\a)) (49) 
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^93 



Taking the partitions = qi, Q^' = q2 and Q^" = we 
calculate the precision of the auxiliary monitoring m. 



mm mm 



E 



■m{q\a) = y 



(50) 



The monitoring to is y-perfect. 

In order to decide whether the mediator can reconstruct the 
desired monitoring, let us check if the auxiliary monitoring 
TO of the mediator is a coloring of the graphs Qg-^ and Qg-^ of 
the players. To illustrate this, we associate the colors blue, red 
and green to respectively qi, q2 and q^. For each player i £ 



^2. * 
■•P1P2 



PlP2,^ 



IP2 



PlP2f 



P1P2 



92. , 

mP2 



«PlP2 



JC, the pair of auxiliary monitoring {m,gi) satisfy an {x,y)- 

coloring condition (recall that x' < x). Thus, the mediator gets 

sufficient information to reconstruct the e-perfect monitoring 

at the players with e — x + y — xy. 

To extract the essential information from mediator's signal 

q without decreasing the precision of the monitoring, let 

us introduce the following bi-auxiliary graph. The coloring 

ri 
91 • 



92 



193 



''2 



h : Q — > R of the above bi-auxiliary graph characterizes 
the essential information the mediator should give to the 
player in order to re-establish the e-perfect monitoring. Recall 
that this essential information is optimal in the sense of the 
cardinality of R and of the precision of the monitoring. It 
cannot be reduced without introducing a larger ambiguity 
between action profiles for at least one player. The process 
of strategic information is described as follows: 

For each player i e /C, the pair of auxiliary monitoring {ho 
TO, gi) still satisfy an (x, 2/)-coloring condition and is moreover 
minimal in term of cardinality \R\. To reconstruct the e-perfect 
monitoring at the player, the mediator will send the common 
information r such that each player, knowing the private signal 
Si can reconstruct the right action profile with probability more 
than 1 — £. 



P1P2 



P1P2 




^93 93 
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Suppose now that player 1 plays a mixed strategy (2/3,1/3) 
and player 2 plays a mixed strategy (2/3, 1/3). Assume from 
now that they play repeatedly following this mixed strategy. A 
sequence of action profiles is generated from the distribution 
p G A{A) and it leads to the payoff vector (0.22,0.22) (see 
Fig. (4)). 

P2 P2 
Pi ' ' 

P*l 
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2/9 


2/9 


1/9 



The entropy of such a distribution source is H{a) = log 9 — 
4/3 ~ 1.8366. Fix the noise level of the transitions functions 
at X = = y = 1/10. The process of information generated 
a source of essential information {ri,r2) with distribution 
(49/90,41/90) of entropy H{r) ~ 0.9943. To transmit the 
source of essential information, the mediator considers the side 
information Si from the transition channel of player i. 



7i(s|ri) 



Ea.,P(a,g,r,g) 
Ea,,PK9,r-) 
Ea.qP(g)"^(9|a)^(^l9)g»(g|a) 
T,a,qJ2a.,qPio-)rni'l\a)hir\q) 

The transition matrix of the channel are evaluated. 



Ti(si|ri) 
Ti(s[\n) 
Ti(si|r2) 
Ti(s[\r2) 



353/490 
137/490 
217/410 
193/410 



(51) 
(52) 



(53) 
(54) 
(55) 
(56) 



We represent it as a binary channel. The channel transition 

353/490 




217/410 



of player 2 is also characterized and is found to be identical. 
Using the Slepian and Wolf binning scheme, the entropy of 
the essential information with side information writes as : 



H = m&xH{R\Si 



(57) 



In this case, we have p{ri\si) = 353/570, p(r2\s'i) = 
217/330, p{r2\si) = 193/570 and piri\s[) = 137/330 and 
Ti = T2. The minimal information rate sent by the mediator 
to both players is 



H = H{R\Si) = H{R.\S2) ^ 0.9451 



(58) 



Under the condition that the rates pair (H, H) belong to the 
capacity region of the channel between the mediator and the 
players, the mediator can reconstruct the e-perfect monitoring 
at the players with precision e = x + y — xy— 19/100. 
The price of re-establishing e-Perfect Monitoring writes: 



PREEPM" 



maxjgA: H{r\si) 0.9451 



H{a) 



1.8366 



0.5145 (59) 



Taking the same monitoring structure with a noise level 
a; = a;' = y = 0, we investigate the noiseless version of 
the reconstruction of the perfect monitoring. The price of re- 
estabUshing Perfect Monitoring becomes: 



PRPM° 



maxjg^g(r|sj) 
H{a) 



0.5 



(60) 



We conclude that in the noiseless problem, the private monitor- 
ing structure provides almost half the information needed by 
the players to reconstruct the source of strategic information. 
Whereas in the noisy case, the additional monitoring structure 
is in charge by almost 51.5 % of the reconstruction the 
strategic source of information. 

VII. Concluding remarks 

The monitoring problem of strategic information is ad- 
dressed in this paper. Taking into account the private monitor- 
ing structure, a mediator is introduced in order to re-establish 
e-perfect monitoring at the players. In order to evaluate the 
signaUng cost for the mediator, the problem of the recon- 
structing a strategic information is re-interpreted as a channel 
of communication theory. Graph theory and Shannon theory 
are respectively exploited to provide a characterization of the 
admissible monitoring structure and analyze their efficiency 
in term of "price of re-establishing e-Perfect Monitoring" 
(PREEPM). A coding theorem is provided for the channels 
where the mediator observes the source imperfecdy and the 
strategic information is drawn from an i.i.d. source. Chal- 
lenging open problems appear when considering a source of 
information generated by an arbitrary stochastic process. For 
example, in the case of imperfect monitoring of past actions, 
the players can choose an appropriate sequence of actions 
such as to manipulate the coding schemes. Another interesting 
extension is to consider a mediator that sends private messages 
to the players instead of common messages. It would also be 
of interest to provide conditions for changing an imperfect 
monitoring structure into another imperfect monitoring 
structure M' (not necessarily perfect or almost perfect). 
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APPENDIX 

By definition, e is the minimum admissible value such that: 

3 T = (Ta)a, Va £ A, ^ AC^Ja) > 1 - e 



3 T = (Ta)a , 



J2 A(^i|a)>l- 



Taking the minimum admissible value for e, the monitoring 
A is e-perfect if and only if there is equality in the above 
equation. 

Proof: We will prove that the conditions (1) and (2) are sufficient. 
The first conditions (1) states that there exists a pair (x, y) such that x + 
y — xy < e and for each player i & JC, the private monitoring gi and the 
monitoring of the mediator m satisfy an (x, j/)-coloring condition. Taking 
now the minimal coloring of the bi-auxiliary graph h : Q — > R, we will 
show that for every player i E K., the joint monitoring {gi, horn) is x+y—xy 
perfect. 

Fix a player i K. Let {Qp}p^A be the partition of signals q € Q 
indexed by the equivalence classes with respect to the monitoring m 

and {Sa}aeAg. the partition of signals s G Si indexed by the equivalence 



classes of A with respect to the monitoring g^. By hypothesis, {Q/jj/jgAm 
is a coloring of the graph Gg^. We first show that (h{Qfi))fi^A,„ is still a 
coloring of the graph Gg^ . Take a and b two neighbor nodes of the graph Gg^ . 
By the coloring property, the sets of associated color Qa (a) and Qa (b) are 
disjoint. Thus each pair of colors q G Qa{<i) and q' £ Qa{b) are neighbor 
in the bi-auxiliary graph. The coloring h : Q — > R of the bi-auxiliary graph 
implies that (/i(Qa))asAm is still a coloring of the graph Gg^. Second, the 
coloring property implies that the following product Ta = Sa {a)xh{Qa{a)) 
defines a partition (Ta)„gA of T. Let us calculate the precision of such a 
joint monitoring. For all strategic information a £ A: 

= ,r-eTa seSo(a) 9 E (o ) r £ h ( Q ^ (a) ) 

> Yl Si^la) Y "(ll") 

> (1 - - x) = 1 - (x + H - xy) 

Thus, for each player i a K., the monitoring {gi,hom) satisfies an a; + j/ — 
xy-Perfect Monitoring condition. It remains to transmit that signal over the 
broadcast channel with common messages /. The condition (2) states that the 
essential rate H satisfies H < Co, the capacity Co of the broadcast channel / 
with common messages |10|. The joint source-channel coding theorem states 
that there exists appropriate mappings: 



■>Pi 



■ X'' 



(61) 
(62) 



such that 
messages 
correctly 
r") > 1 ■ 
symbol r 
an strategi 
by X + y 



transmitting the source r over the broadcast channel with common 
/ is possible with an eiTor probability < S. The above mappings 
transmit every sequence r" with probability more than Vir" = 
— S. When the sequence r" is correctly decoded, at each stages the 
combined with the side symbol Si for each player i, are associated to 
ic information profile a where the stage error probability is bounded 
xy. We proved that 



= {T^ 



e A}, Va e A, Y 



Proof: First we show that conditions (1') and (2) are sufficient. The 
monitoring of the mediator m is a painting of the family of graphs {Qi)i£ic- 
This imphes that for each player i £ K, for each pair of strategic information 
a, 6 £ A, if the private signal as a positive probability to be the same, then 
the signal observed by the mediator will distinguish them. 



gi{a) n gi{b) ^ m(a) n m{b) ■ 



(63) 



Moreover, the recoloring h : Q 

i G K, 

g^{a) n gi{b) H) 
q G m{a), q' G m(b) 



R keeps this property. For all player 



h{q) h(q') ^r^r' 



(64) 
(65) 



Condition (1') implies that, for each player i G /C, the pair of information 

(si, r) is sufficient to reconstruct the Perfect Monitoring. 

Condition (2) states that H < Co which implies that the rate of this 

information r is lower than the capacity of the channel between the mediator 

and the player i G /C. Thus, by the source-channel coding theorem, we have 

that: 



Proof: We ever show that the condition (1) is sufficient to reconstruct 
X + y — xi;-Perfect Monitoring in one shot. We show that if the family of 
channels {fi)i^)c between the mediator and each player satisfy an z-perfect 
condition (condition (2')), then each player monitors with a precision at least 
of x+y+z — xy — xz — yz+xyz. Let us calculate the precision of such a joint 
monitoring received by player i is : yl — > Si xYi. For all joint strategic 
information a G A and for each player i £ IC, the (x, j/)-coloring property 
guarantees the existence of a partition defined by = X Qp X Yf such 
that a a a, a a P and r G h{Qp). The precision of the joint monitoring is 
upper bounded by, 

Yl Ai(s,»|a) 
Yl 9i(=l») J2 12 H fi(vlr-)h(r\q),r,(q\a) 

Y, 9i(s|a) Y. '"(<!|f) T. JiivM 

ses»^ qeQ/j vevr 

> (1 - - x)(i - ^) 

= 1— (x + y + s — xy — X2 — 2y + xyz) 



For each player i G IC, the joint monitoring 
x + y + z — xy — xz — yz + xyz perfect. 



idi) fi oho m) is 



Nash Equilibrium 
Operating Point 
Deviation Player 2 
Deviation Player 1 
Mixed Strategy 




0.2 0.25 
Utility for Player 1 [biVJ] 

Fig. 4. Nash Equilibrium, Operating Point and the Deviation Utihties for 
{K,M,N) = (2,2,2) 



Ve > 0, 3(n, h, 4>, {^l!i)i^|c)-pl'ocess such that, < e 



(66) 



This implies that the mediator can reconstruct the Peii'ect Monitoring. Second, 
we show that conditions (1') and (2) are necessary. Suppose that condition 
(2) does not hold. Then, by the source-channel coding theorem of Merhav 
and Shamai (2003 [1311), it is impossible to transmit the source r over the 
channel / with low error probability. The mediator cannot reconstruct the 
Perfect Monitoring. 

Suppose that condition (1') does not hold. Then, there exists a player i and a 
pair of strategic information a, b that have the same color s and there exists 
an edge e = (a,b). This implies that with positive probability player i will 
observe a private signal r and a public signal s when strategic information a 
or b is drawn. Then, the mediator cannot reconstruct the Peii'ect Monitoring. 



